Formant estimation for speech recognition
نویسندگان
چکیده
This paper presents a new method for estimating formant frequencies. The formant model is based on a digital resonator. Each resonator represents a segment of the short-time power spectrum. The complete spectrum is modeled by a set of digital resonators connected in parallel. An algorithm based on dynamic programming produces both the model parameters and the segment boundaries that optimally match the spectrum. We used this method in experimental tests that were carried out on the TI digit string data base. The main results of the experimental tests are: 1) the presented approach produces reliable estimates of formant frequencies across a wide range of sounds and speakers; and 2) the estimated formant frequencies were used in a number of variants for recognition. The best set-up resulted in a string error rate of 4.2% on the adult corpus of the TI digit string data base.
منابع مشابه
Statistical Variation Analysis of Formant and Pitch Frequencies in Anger and Happiness Emotional Sentences in Farsi Language
Setup of an emotion recognition or emotional speech recognition system is directly related to how emotion changes the speech features. In this research, the influence of emotion on the anger and happiness was evaluated and the results were compared with the neutral speech. So the pitch frequency and the first three formant frequencies were used. The experimental results showed that there are lo...
متن کاملA Formant Tracking Lp Model for Speech Processing in Car/train Noise
Formant estimation becomes complicated in the presence of correlated background noise such as car and train noise as the spectrum of noise from revolving mechanical sources have their own spectral peaks that affect the number and positions of the observed peaks in noisy speech spectrum. This paper investigates the modeling and estimation of spectral parameters at formants of noisy speech in the...
متن کاملA formant tracking LP model for speech processing
This paper investigates the modeling and estimation of spectral parameters at formants of noisy speech in the presence of car and train noise. Formant estimation using twodimensional hidden Markov models (2D-HMM) is reviewed and employed to study the influence of noise on observations of formants. The first set of experimental results presented show the influence of car and train noise on the d...
متن کاملEfficient estimation of perceptual features for speech recognition
A number of studies have shown that a pair of perceptual eeective formants can be deened to capture most of the phonetic information present in vowels. Various methods of computing the eeective formant values were proposed. However, many of them depend on the accuracy of conventional formant estimation. In this work, we study methods of automatically estimating perceptual eeective formants with...
متن کاملFormant Analysis of Bangla Vowel for Automatic Speech Recognition
To provide new technological benefits to the mass people, nowadays, regional and local language recognition draws attention to the researchers. Similarly to other languages, Bangla speech recognition scheme is demandable. A formant is considered as the resonance frequency of vocal tract. Formant frequencies play an important role for the purpose of automatic speech recognition, due to its noise...
متن کاملOn the robust incorporation of formant features into hidden Markov models for automatic speech recognition
A formant analyser is interpreted probabilistically via a noisy channel model. This leads to a robust method of incorporating formant features into hiddenMarkov models for automatic speech recognition. Recognition equations follow trivially, and Baum-Welch style re-estimation equations are derived. Experimental results are presented which provide empirical proof of convergence, and demonstrate ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IEEE Trans. Speech and Audio Processing
دوره 6 شماره
صفحات -
تاریخ انتشار 1998